Text Line detection and Segmentation in Handwritten Gurumukhi Scripts
نویسندگان
چکیده
Gurumukhi script is a two dimensional composition of symbols with connected and disconnected diacritics. Handwritten Gurumukhi script has some complexities like connected, overlapped text lines. It is one of the major reasons for errors during the recognition process. Text line segmentation is a challenging job in unconstrained writer independent handwritten document image processing. There is a great need for research in the area of Punjabi handwriting recognition to resolve challenging problems involved in it. In this paper we have proposed an algorithm for text line segmentation in handwritten Punjabi document that deals with the problems like overlapped and connected components in text line and extract text lines from handwritten document image. The text line detection algorithm is based on locating the most favourable segments of text line and associating it with its respective text line inserting a gap between neighbouring text lines. Keywords— Text line segmentation, overlapped text lines, connected text lines, average height, Gurumukhi script
منابع مشابه
Review: A Literature Survey on Text Segmentation in Handwritten Punjabi Documents
Gurumukhi script is used for Punjabi language, which is a two dimensional composition of symbols with connected and disconnected diacritics. Handwritten Gurumukhi script has some complexities like connected, overlapped text lines, words and characters. It is one of the foremost issues for errors during the recognition process. Text segmentation is a challenging job in unconstrained writer indep...
متن کاملA New Algorithm for Detecting Text Line in Handwritten Documents
Curvilinear text line detection and segmentation in handwritten documents is a significant challenge for handwriting recognition. Given no prior knowledge of script, we model text line detection as an image segmentation problem by enhancing text line structure using a Gaussian window, and adopting the level set method to evolve text line boundaries. Experiments show that the proposed method ach...
متن کاملAn Approach to GUI Identification for Printed Gurumukhi and English Text
Optical Character Recognition system is used to recognize printed and handwritten alphanumeric text from input image. A numerous of methods have been published based on optical character recognition. In proposed work expansion of optical character recognition to recognize multi-scripts is done which in infancy. Such type of expansion is crucial in India where each state has diverse language. Th...
متن کاملPrinted Text Recognition System for Multi-Script Image
Optical Character Recognition system provides transformation of input text into editable form. Multi-script recognition systems are requisite in the countries like India where different people speak different languages in numerous states of country. In the recent time, multi-script recognition is a demanding problem and research work for expansion of optical character recognition scheme for cla...
متن کاملProblems and Review of Line Segmentation of Handwritten Text Document
Optical character recognition (OCR) is a very popular research area since 1950's. Many people has done a lot of work on various scripts. Line segmentation is a very important step in OCR as the accuracy of the recognition algorithm highly depends on the correct line segmentation. Incorrect line segmentation not only decreases the accuracy but also may lead to some other errors. The objective of...
متن کامل